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© Process for the production of D-alpha-amino acids. 

© A process is described for the production of D-a-amino acids by the stereospecific conversion of racemic 
mixtures of 5-substituted hydantoins, where said conversion is carried out in the presence of a microorganism 
transformed with a plasmid capable of expressing at high levels and without inducers an enzymatic system 
capable of converting said hydandoins in the corresponding D-a-amino acids. 

A plasmid is also described comprising the genes which encode said enzymatic system and a microorgan- 
ism selected from Escherichia coli or Bacillus subtilis transformed with said plasmid. 

D-a-amino acids are intermediates useful in the preparation of pharmacologically active substances, 
pesticides and sweeteners. 
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The present invention relates to a process for the production of D-a-amino acids by the stereospecific .^l. v -. U . K 

conversion of racemic mixtures of 5-substituted hydantoins with a microorganism transformed with a 
plasmid capable of espressing in high yields and without inducers an enzymatic system capable of directly 
converting said hydantoins into the corresponding D-a-amino acids. 
5 The term enzymatic system refers to a system consisting of D-hydantoinase and D-N-carbamylase 
enzymes. 

D-a-amino acids are extremely valuable compounds useful for the preparation of pharmacologically 
active substances (for example, D-phenylglycine and D-parahydroxyphenylglycine are used in the synthesis 
of penicillins and cephalosporins), pesticides (D-valine for the synthesis of the insecticide fluvanilate) or 
w sweeteners (D-alanine). 

The preparation of these enzymatic systems, however, requires the use of efficient inducers capable of 
stimulating the production of these enzymes on the part of the microorganisms. It is, in fact, known that the 
expression level of the enzymes D-hydantoinase and D-N-carbamylase is constitutively very low (Syldatk et ; . . 

al. (1990), "Advances in Biochem. Engineering/Biotechnology (Reenter, A. Ed.), 41, pages 29-75, Springer- • 
75 Verlag, Berlin). 

The inducers normally used are derivatives of hydantoins or nitrogenated cyclic compounds which are 
howevery easily metabolized by the microorganisms, or compounds such as uracil or thio-2-uracil or 
thymine which are not metabolized (Meyer et al., (1993), Ferns Microbiol. Letters, 109: 67-74). 

The use of inducers creates a series of drawbacks among which an increase in the production costs 
20 and a certain variability in the production yields of the enzymes. In addition, the expression level which can 
be obtained in most of the microorganisms following induction is insufficient for economical use in industrial 
processes (Syldatk et al. (1987), Biotechnol. lett., 9: 25-30; Yokozeki et al. (1987) Agric. Biol. Chem., 51^, 
715-722). 

Recently the genes which encode the enzymes D-hydantoinase and D-N-carbamylase have been 
25 individually sequenced and cloned (US 4.912.044 and EP-51 5-698). 

More specifically, patent US 4.912.044 describes the preparation of D-hydantoinase by the fermentation 

of a microorganism transformed with a hybrid vector containing the hydantoinase gene whose expression is ... : ; 

induced by temperature variation. The enzyme thus obtained is used for the production of D-N-carbamyl 
derivatives from 5-substituted hydantoins. 
30 Patent application EP-51 5.698 describes, on the other hand, the preparation of D-N-carbamylase by the 
fermentation of a microorganism transformed with a plasmid comprising the carbamylase gene whose 
expression is chemically induced with IPTG. The enzyme thus obtained is used for the production of D-a- 
amino acids from N-carbamyl derivatives. 

As industrial interest is directed towards the conversion of racemic hydantoins to D-a-amino acids, the 
35 fact that the two enzymes are expressed in different strains involves the use of both and consequently the 
development of a process starting from two distinct fermentative processes. 

This obviously increases the production costs and reduces the conversion kinetics. In fact, in order to 
complete the enzymatic reaction, the N-carbamyl derivative produced by the transformed microorganism 
containing the hydantoinase must pass through the bacterial membrane, spread into the reaction medium 
40 and then proceed in the opposite direction to reach the second enzyme (carbamylase) present in the other js'^VV " 

strain. All this is particularly penalizing from the point of view of kinetics considering the reduced 
permeability of the bacterial membranes to the carbamyl derivatives (Olivieri et al. (1981), Biotechnol. 
Bioeng., 23, 2173-2183) and the inevitable dilution of the carbamyl itself in the reaction mixture. 

Finally, the use of a double volume of biomass has a negative influence on the yields and degree of - v; <•> • '.' 

45 purity of the final product. 

In addition, the necessity of having to induce the expression of these enzymes creates a further 
problem thus making these processes of little interest for practical use. 

The object of the present invention is to overcome the disadvantages of the known art described above. 
In particular it has now been found, in accordance with the present invention, that the use of a particular 
50 plasmid which contains the genes of D-hydantoinase and N-carbamylase put under the control of an 
appropriate synthetic promoter, enables the high expression of these enzymes to be obtained without 
inducers. 

It is therefore possible to prepare a single microorganism transformed with said plasmid containing the 
two enzymatic activities inside. This solution solves not only the problems relating to kinetics due to the . . 

55 limited permeability, as the two reactions occur inside the same cell where the concentration of the 
substrates is excellent, but also those relating to the requirement of inducers and treatment of the product 
and of the waste products. 
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In accordance with this, a first aspect of the present invention relates to a process for the production of 
D-a-amino acids by the stereospecific conversion of racemic mixtures of 5-substituted hydantoins char- 
acterized in that, the conversion reaction is carried out in the presence of a microorganism transformed with 
a plasmid capable of expressing at high levels and without inducers an enzymatic system capable of 
5 converting said hydantoins into the corresponding D-a-amino acids. 

A further object of the present invention is the plasmid pSM651 comprising the genes which encode the 
enzymatic system. 

Yet another object of the present invention is a microorganism transfored with the plasmid pSM651 
capable of expressing with high efficiency and without inducers an enzymatic system capable of 
io stereospecifically converting racemic mixtures of 5-substituted hydantoins into the corresponding D-a-amino 
acids. 

A further object of the present invention relates to the use of said microorganisms or enzymatic system 
isolated from said microorganisms for the production of D-a-amino acids by the stereospecific conversion of 
racemic mixtures of 5-substituted hydantoins. 
75 Further objects of the present invention will be evident from the description and examples below. 

Brief description of the figures 

Figure 1 : Map of the plasmid pSM637 containing the carbamylase gene 
20 Figure 2 : Map of the plasmid pSM650 containing the hydantoinase gene 

Figure 3 : Map of the plasmid pSM651 containing the hydantoinase-carbamyiase operon. 
Figure 4 A-B : Nucleotide and amino acid sequence of carbamylase. 
Figure 5 A-C : Nucleotide and amino acid sequence of hydantoinase. 

Figure 6 : SDS-PAGE (A) and Western-Blot (B) of the total proteins extracted from cultures of E.coli and 
25 B.subtilis transformed with the plasmid pSM651 wherein: 

line 1 : standard hydantoinase 

line 2: standard carbamylase 

line 3: E.coli (pSM671) control 

line 4: E.coli SMC305 
30 line 5: B.subtilis (pSM671 ) control 

line 6: B.subtilis SMS373 

The genes which encode the D-hydantoinase and D-N-carbamylase enzymes can be isolated from 
microorganisms such as Pseudomonas, Hansenula, Agrobacterium, Aerobacter, Aeromonas, Bacillus, 
Moraxella, Brevibacterium, Flavobacterium, Serratia, Micrococcus, Arthrobacter or Paracoccus. Specific 
35 examples of these microorganisms are Bacillus macroides ATCC 12905, Aerobacter cloacae IAM 1221, 
Agrobacterium sp. IP 1-671, Agrobacterium radiobacter NRRLB 11291, Pseudomonas sp. FERM BP 1900. 

The isolation of the genes which encode the D-hydantoinase and D-N-carbamylase enzymes can be 
carried out by the construction of a gene library, representing the genome of the microorganism, 
identification of the clones containing the genes which encode said enzymes, analysis of the gene 
40 sequence, insertion of said genes into a vector and control of their expression. 

The term gene library or genome bank means the combination of clones of a given host microorganism 
ach of which carries a fragment of the chromosomal DNA deriving from the donor organism of which the 
bank is to be obtained. A bank is defined as being representative when the combination of the single 
fragments contained in each clone forms the majority of the chromosomal DNA of the donor organism. 
45 According to a preferred embodiment of the process of the present invention, the strain A.radiobacter 
NRRL B-11291 is used as donor organism for the isolation of the genes which encode D-hydantoinase and 
D-N-carbamylase. 

In practice, two genome banks of said microorganism are constructed in E.coli by digesting the 
chromosomal DNA separately with the restriction enzymes BamHl and Sacl. Among the fragments obtained 
50 with the two digestions, those having dimensions normally of between 3,000 and 4,500 bp are purified. The 
selection is carried out by estimating the molecular weight of the D-hydantoinase and D-N-carbamylase 
enzym s of 50,000 and 34,000 Daltons respectively. 

The two populations of BamHl and Sacl fragments are then ligated to a vector of E.coli under such 
conditions as to facilitate the condensation of a single fragment to each molecule of the vector. The two 
55 ligase mixtures are used to transform cells of E.coli made competent as shown for example by Dagert, M. 
and Ehrlich (1979), (Gene, 6:23). 

The two populations of colonies (genome banks) thus obtained, each of which carrying a hybrid plasmid' 
i.e. consisting of the molecule of the vector and a chromosomal DNA fragment of A.radiobacter , are then 
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selected to identify those clones containing the hydantoinase and carbamylase genes. 

The identification can be carried out by direct expression or using specific probes. The second method 
is preferably used. For the selection of the probes, in the case of carbamylase, reference was made to the 
knowledge of the amino-end sequence of carbamylase by Comomonas sp. 5222c (Ogawa et al. (1993), Eur. 
J. Biochem., 212: 685-691). 

On the basis of this sequence small oligonucleotides are synthesized which, once marked, are used for 
the screening of the genothecas by hybridization techniques (Maniatis et al., (1982), "Molecular Cloning: a 
laboratory manual", Cold Spring Harbor Laboratory). 

This permitted the identification of a clone carrying a hybrid plasm id carrying a Bam HI fragment 
containing the nucleotidic sequence which encodes for the whole carbamylase. Analysis of said plasmid 
showed, in addition, the presence of a second incomplete ORF, placed on the other strand with respect to 
the carbamylase gene, which showed a homology with urease portions isolated from various microorgan- 
isms. 

As ureases, like hydantoiases, are enzymes belonging to the group of amido-hydrolases, it was 
assumed that the incomplete ORF corresponded to that of hydantoise. This assumption was then confirmed 
by the enzymatic activity tests carried out on cellular extracts of cells carrying the identified gene. 

In order to isolate the whole nucleotide sequence encoding hydantoinase, a screening of the gene 
library of the DNA of A.radiobacter digested with Sacl was carried out by hybridization with an 
oligonucleotide synthesized on the basis of the nucleotide sequence of the incomplete ORF. 

The screening led to the isolation of a clone containing the whole hydantoinase gene. The genes thus 
isolated were sequenced using the sequenase version Kit 2.0 sold by United State Biochemical. 

For the construction of a plasmid comprising both of the isolated genes vectors selected from plasmids, 
cosmids and bacteriophages known in the art, can be used. 

The bifunctional plasmid of E.coli and B.subtilis . pSM671 CBS 205.94 is preferably used. 

This plasmid, which comprises the genes which encode for resistance to kanamycin and chloram- 
phenicol and has replication origins operable in E.coli and B.subtilis , is characterized in that it contains a 
synthetic promoter capable to direct with high efficiency and without inducers, the expression of the genes 
put under its control. 

In practice, the DNA fragments containing the genes which encode the D-hydantoinase and D-N- 
carbamylase enzymes are cloned into the plasmid pSM671 in the unique restriction sites EcoRI and Hindi!! 
obtaining the recombinant plasmid pSM651. 

The construction can be carried out operating according to the general techniques known in the field of 
recombinant DNA. In order to verify whether these enzymes are expressed from B.subtilis and E.coli , cells 
transformed with said plasmid are cultured in a suitable culture medium. The total proteins, extracted from 
the cellular lysate, analyzed on polyacrylamide gel showed the presence of two proteins having a molecular 
weight corresponding to that of the two enzymes; these proteins represent about 10% of the total proteins. 
These results confirm the capacity of B.subtilis and E.coli to express said enzymes with high efficiency and 
without inducers. 

The enzymatic system of the present invention can be obtained by culturing the strains E.coli or 
B.subtilis transformed with the plasmid pSM651 , under aerobic conditions, in an aqueous medium contain- 
ing assimilable sources of carbon and nitrogen as well as various cations, anions and, possibly, traces of 
vitamins, such as biotin, thiamine, or amino acids. 

Assimilable carbon sources comprise carbohydrates such as glucose, hydrolized amides, molasses, 
sucrose or other conventional carbon sources. 

Examples of nitrogen sources can be selected, for example, from mineral ammonium salts, such as 
ammonium nitrate, ammonium sulphate, ammonium chloride or ammonium carbonate and urea or materials 
containing organic or inorganic nitrogen such as peptone, yeast extract or meat extract. 

The following cations and anions are equally suitable for the object of the present invention: potassium, 
sodium, magnesium, iron, calcium, acid phosphates, sulphates, chlorides, manganese, and nitrates. 

The fermentation is carried out, under stirring, at a temperature of between 25 • and 40 • C, preferably 
between 30' and 37* C and at a pH of between 6 and 7.5, preferably between 6.5 and 7.0. 

The cells (biomass) recovered from the culture medium by means of the conventional techniques such 
as centrifugation or filtration are used in the conversion phase of the racemic mixtures of 5-substituted 
hydantoins. 

Alternatively, the conversion reaction can be carried out using either the cellular xtract obtained from 
the disintegration of the cells by sonication or French-Press, or enzymes purified or partially purified with 
the conventional methods, or enzymes immobilized on insoluble supports. 
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Numerous hydantoins substituted in position 5 can be used in the process of the present invention. 
Possible substituents in position 5 are selected from a linear or branched alkyl group with a number of 
carbon atoms of between 1 and 6, which can be mono or polysubstituted with hydroxy, carboxy, 
hydrosulphide or amino groups or a phenyl or benzyl group which, in turn, can contain one or more 
5 substituents in ortho, meta and para position. Examples of 5-substituted hydantoins are: D,L-5-phenylhydan- 
toin, D,L-5-para-hydroxyphenylhydantoin, D f L-5-methylhydantoin, D.L-5-isopropylhydantoin, D,L-5-thienyl- 
hydantoin, D,L-5-para-methoxyphenylhydantoin, D,L-5-para-chloro phenylhydantoin, D,L-5-benzylhydantoin. 

The conversion of the hydantoins into the corresponding D-a-amino acids is carried out in a nitrogen 
atmosphere in a hermetically closed apparatus, at a temperature of between 20 and 60 *C, preferably 
w between 30 and 45 *C. 

The pH of the reaction medium is maintained within values of between 6 and 10 and preferably 
between 7 and 8.5. This regulation of the pH can bo carried out, for example, by adding a base aqueous 
solution such as an aqueous solution of ammonia, potassium hydroxide, sodium hydroxide, sodium or 
potassium carbonate. 

75 The initial concentration of the hydantoins is generally between 2% and 30% by weight. As a result of 
the stereospecificity of the enzymes produced from the strains of the present invention, only the D- 
enantiomorphs of the hydantoins are hydrolized. As hydantoins however, spontaneously racemize more or 
less rapidly under the operating conditions, the L-enantiomorphs are completely converted into the 
corresponding D-a-amino acids. 

20 The quantity of biomass which is added to the reaction mixture depends on the particular affinity of the 
substrate towards the enzymes. Generally a ratio by weight biomass/hydantoins of between 1/1 and 1/50 
can be used. 

When the conversion reaction is carried out under optimum conditions a yield of 95-98% is obtained. 
The D-a-amino acids prepared with the process of the present invention can be recovered from the 
25 reaction medium with the conventional methods such as ion-exchange chromatography or precipitation of 
the amino acid at its isoelectric point. 

The plasmid pSM651 was deposited at the Bureau Voor Schimmelcultures, SK Baarn (Holland) as E.coli 
SMC305 where it received the deposit number CBS 203.94. 

The following experimental examples provide a better illustration of the present invention but do not 
30 limit it in any way. 

Example 1 

Extraction of the chromosomal DNA from A.radiobacter 

35 

100 ml of fermentation medium having the following composition: 
1% glucose, 0.3% yeast extract, 1.36% KH2PO4, 0.02% MgS0 4 .7H 2 0 (pH 7.0) were inoculated with the 
strain A.radiobacter (NRRLB 1 1291) and maintained under stirring (220 rpm) at 30 *C for 24 hours. 

The cells were then recovered by centrifugation of the culture broth in an SS34 rotor model Sorvall RC- 
40 5B (at 4*C and 5000 rpm for 10 minutes) and then washed (2x120 ml) with a solution (TE) containing 1 mM 
EDTA, 10 mM Tris-HCI, pH 7.4. The resulting suspension was centrifuged again as above and the cells 
were recovered and resuspended in 9.5 ml of TE solution. After adding 0.5 ml of 10% SDS (sodium 
dodecylsulphate) and 50 ul of a solution of Proteinase K (20 mg/ml), the suspension was incubated at 37 *C 
for 1 hour. 

45 18 m ' ° f NaCI 5 M and 15 ml of a so,ution consisting of 10% hexadecyltrimethyl ammonium bromide 
(CTAB) in 0.7 M NaCI were subsequently added and the resulting solution was incubated at 65 • C for 20 
minutes. The solution was then deproteinized with an equal volume of chloroform/isoamyl alcohol (24/1, v/v) 
and the DNA was precipitated with 0.6 volumes of isopropanol. The DNA was washed with 1 ml of ethanol 
(70%) and recovered with a glass rod. the recovered DNA was finally dissolved in 4 ml of TE and its 

50 concentration was determined by spectrophotometry at 260 nm. 

The chromosomal DNA was purified again by centrifugation on a gradient of CsCI (1%) containing 0.1 
mg/ml of ethidium bromide (55,000 rpm for 16 hours in a Beckman rotor V65Ti). 

The DNA band was visualized under a UV light and the ethidium bromide was removed by extraction 
with butanol saturated in H 2 0. After dialysis against a TE buffer, the DNA was precipitated with ethanol and 

55 resuspended in the desired volume. 
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Example 2 

Construction of a genomic bank of A.radiobacter 

5 Aliquots (10 ug) of the DNA thus obtained were digested, separately, with 25 units of each of the 
restriction enzymes EcoRI, Pstl, BamHI, Sacl, and Sphl (Boehringer) operating according to the instructions 
of the producer. 

After blocking the enzymatic reactions at 65 # C for 10 minutes, the reaction mixtures were charged onto 
agar gel at 0.8% and run at 100 volts for 2 hours. The DNA bands, visualized by coloring with EtBr (0.5 
w gamma/ml), were then transferred onto a nylon filter (Boehringer) and after lysis with NaOH, the DNA was 
immobilized according to the Southern blot technique (Maniatis et al. t "Molecular Cloning: a practical 
laboratory manual", Cold Spring Harbor, New York, 1982). 

The filter was hybridized at 45 *C with each of the degenerated oligonucleotides, conceived on the 
basis of the amino-end of the carbamylase of Comamonas sp. E222c (Ogawa et a!., (1993), Eur. J. 
is Biochem., 212: 685-691), having the sequence: 

1) 5'CGA ATT GTA AAT TAT GCA GCA GC 3' 
20 AGCGCCGG 

C A C C C 

T T T T 

2) 5'GGA CCA ATT CAA CGA GC 3' 
G G C G G 
CCA C 
T T T 

3) 5'CGA GCA GAT GTA ATG GA 3' 
AG G C G 

C C C 
T T T 



25 



30 



35 



40 



45 These oligonucleotides were synthesized using the automatic System OLIGO 1000 system of Beck- 
mann and then marked at the 5'end using the kit DIG SYSTEM (Boehringer). The hydridation reaction with 
probe 2 gave positive signals. In particular, the DNA digested with BamHI generated a fragment of about 
4000 bp capable of hybridizing the probes. 

To isolate the BamHI fragment thus identified, 10 ug of chromosomal DNA were suspended in 50 ul of 

so buffer 10 mM Tris-HCI pH 8, 100 mM NaCI, 5 mM MgCI 2 , 1 mM 2-mercaptoethanol and incubated at 37 *C 
for 4-5 hours in the presence of 25 U of the enzyme BamHI. 

The digestion mixtur was then subjected to electrophoresis on agar gel at 0.8% and, after colouring 
with EtBr, DNA fragments of 3,500 - 4,500 bp were electroeluated in the electrophoresis buffer (Maniatis et 
al. "Molecular Cloning: a practical laboratory manual", Cold Spring Habor, New York 1982). 

55 The chromosomal DNA fragments in the plasmid pUCl8 (BRL) were then cloned. In practice, 20 ng of 
this plasmid, previously linearized with the restriction enzyme BamHI, were ligated with 100 ng of the 
chromosomal DNA fragments in 20 ul of mixture containing 66 mM Tris-HCI pH 7.6, 1 mM ATP, 10 mM 
MgCI 2 , 10 mM Dithiotreitol (DTT), in the presence of 1 U of T4 DNA ligase, at 16* C for a night. 
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The ligase mixture was used for transforming cells of E.coli JM101 (BRL) made competent with 50 mM 
CaC 2 (Dagert, M. and Ehrlich (1979), Gene, 6:23). 

The transformants were subsequently selected on plates of LB medium (8 g/l Bactotryptone (DIFCO), 5 
g/l NaCI, 15 g/l Agar (DIFCO), 0.5 g/l yeast extract) to which 40 ug/ml of X-Gal (5-bromo-4-chloro-3-indolyl- 
5 D-lhio-galactopyranoside) and 100 ug/ml of ampicillin had been added. 

Operating as described above numerous positive recombinant colonies (white) were obtained which 
were easily distinguishable from those not recombinant (blue). 

The positive clones were transferred onto nylon filters (Boehringer) and the DNA extracted from these 
clones was hybridized under the same conditions using probe 2 which had responded positively to 
io hybridation with the chromosomal DNA. 

The piasmids extracted from the clones which gave a positive signal were sequenced using the 
Sequenase version 2.0 Kit (United States Biochemical). On© of these piasmids, containing the complete 
carbamylase gene (915 bp) was called pSM652. 

Figure 4 shows the nucleotidic and amino acidic sequence of carbamylase. 

75 

Example 3 

Isolation of the hydantoinase gene of A.radiobacter 

20 Analysis of the plasmid pSM652 showed the presence of a second incomplete ORF, situated on the 
other strand with respect to the carbamylase gene, which showed a homology with urease portions isolated 
from various microorganisms. 

As ureases, like hydantoinases, are enzymes belonging to the group of amidohydrolases, it was 
assumed that the incomplete ORF corresponded to that of hydantoinase. The assumption was then 
25 confirmed by enzymatic activity tests carried out on cellular extracts of cells carrying the identified gene. 

In order to isolate the whole nucleotidic sequence encoding the hydantoinase. the same Southern Blot 
used for isolating the carbamylase was hybridized using as probe the oligonucleotide having the sequence: 
5* ATC GTA ACC GCG GAG GGG ATT TCT CCC 3' 

This oligonucleotide, homologous to the 5'end region of the nucleotidic sequence of identified partial 
30 ORF, was synthesized and marked as shown in example 2. Among the positive bands for this probe a band 
of about 3500 bp obtained from the digestion of the DNA with the enzyme Sad, was identified. 

Operating as shown in example 2 a genomic bank of chromosomal DNA of A.radiobacter digested with 
Sacl was then constructed. Screening of this bank led to the isolation of the plasmid pSM653 containing the 
whole gene for hydantoinase whose nucleotide and amino acid sequence is shown in figure 5. 

35 

Example 4 

Cloning of the carbamylase gene 

40 1 ) Amplification of the carbamylase gene 

The plasmid pSM652 was amplified by the Polymerase Chain Reaction (PCR) technique, according to 
the indications specified by Leung et al. (Leung D.W., Chen E., Goeddel D.V., Technique - a journal of 
methods in cell and molecular biology, 1, No. 1 (1989): pages 11-15), using the pair of oligonucleotides: 

45 

(1) 5 1 GG G AAT TCT TAT GAC ACG TCA G 3' (FORWARD) 

EcoRI 

50 

(2) 5 f CCC AAG CTT CAA AAT TCC GCG AT 3 1 (REVERSE) 

Hindlll 

55 

The oligonucleotide (2) also allowed the restriction site EcoRI present inside the carbamylase gene near 
3'end, to be eliminated. 
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The amplification was carried out in a DNA Thermal Cycler 480 apparatus (Perkin - Elmer Cetus) using 
a reaction mixture (100 ul) containing 10 mM Tris HCI pH 8.3, 1.5 mM MgCI 2 . 50 mM KCI, 0.01% 
(weight/volume) of gelatine, 1 ng of pSM652, 1 uM of the two primers, 200 uM of dNTP, 0.5 Units of Taq 
polymerase (Perkin Elmer). 

5 A drop of mineral oil is added and the mixture is denatured for 4 minutes at 94 # C and the cyclic 
program is started, which comprises: 
1 minute at 94 * C (denaturation) 

1 minute at 55 • C (annealing) 

2 minutes at 72 • C (elongation) 

io for a total of 30 cycles, followed by 8 minutes at 72 *C (final extension). 

The amplification product thus obtained was treated with phenol-chloroform (1:1), precipitated with NaCI 
{1/10 vol/vol) and EtOH (2 vol) and resuspended in 20 ul of H 2 0. After cutting with the restriction enzymes 
EcoRI and Hindlll (5 U) suitable for cloning into the plasmid pSM671 (CBS 205.94) the DNA fragments were 
purified on low-melting gel (SeaPlaque, FMC BioProducts) at 1 .0% and the bands eluated by the gel were 
;s treated with GEIase (Epicentre Technologies) (1 U every 300 ug of gel weighed) for 1.5 hours at 45 *C. 
At the same time, 50 ng of the plasmid pSM671 were cut with the same restriction enzymes. 
The plasmid and fragments were ligated in 10 ul of reaction mixture (DNA 20 ng/ml) and 2 ul of this 
mixture were used for transforming cells of E.coli 71/18 made competent with CaCfe (Dagert and Ehrlich, 
Gene, 6: 23, 1979). The transformants were selected on plates of LB medium containing 20 ug/ml of 
20 chloramphenicol. 

The plasmid DNA extracted from the positive clones was analized to verify exact insertion into the 
carbamylase gene and the absence of possible errors caused by the amplification. 
One of these plasmids was called pSM637. 

The strain of E.coli containing the plasmid pSM637 was called SMC307. 
25 Cells of B.subtilis SMS108 NRRLB-1 5.898 made competent as described in "Molecular Biology 
Methods for Bacillus", (1990) (Harwood and Cutting (eds) Wiley and Sons) were transformed with 100 ng of 
the plasmid pSM637 operating according to the known techniques, and the transformed strain was called 
SMS374. 

30 Example 5 

Expression of the carbamylase gene in E.coli and B.subtilis 

The object of the experiment was to verify the ability of the transformed strains (E.coli SMC307 and 

35 B.s ubtilis SMS 374) to express the carbamylase gene without inducers. 

A preculture on slant of the strain E.coli SMS307 and B.subtilis SMS 374 was inoculated into two 100 
ml flasks containing, respectively, 10 ml of LB medium to which 20 ug/ml of chloramphenicol had been 
added and 10 ml of VY medium to which 5 ug/ml of chloramphenicol had been added. The flasks were 
incubated, under stirring, (220 rpm), at 37 *C for 16 hours. 

40 The cells were recovered by centrifugation (12,000 rpm, 4'C, for 1 minute) of the two culture broths, 
resuspended in 300 ul of buffer 20 mM Tris-HCI pH 7.5, 20 mM BMeOH, 20% glycerol and lysed by 
sonication (Soniprep150, MSE 1 minute impulses, at average voltage). Aliquots (15 ul) of the two lysates 
were charged onto polyacryalamide gel at 10% and run at 20 mA for three hours. The proteic bands were 
visualized by colouring with Coomassie R-250 (Laemmli, Nature: 227, 680, 1970). After colouring with 

45 Coomassie a proteic band was revealed with a molecular weight of 34,000 D absent in the extracts of 
untransformed strains B.subtilis SMS108 and E.coli 71/18. In addition, densitometric analysis carried out on 
the same gel coloured with Coomassie showed that this protein was expressed in both of the transformed 
strains as one of the prevalent proteins (10% with respect to the total proteins). 

so Example 6 

Cloning of the hydantoinase gene 

The plasmid pSM653 (1 ug) was digested with the restriction enzymes EcoRV and Sail (4 U) 
55 (Boehringer) at 37* C for 1 hour. 

The digestion mixture was then subjected to electrophoresis on agar gel at 0.8% (low melting) and, 
after colouring with EtBr, the DNA band corresponding to an EcoRV-Sall fragment of 1300 bp was recut and 
the DNA extracted with the Gelase TM method (EPICENTRE Technologies). As this fragment has a small 
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region missing at the 5*end and a portion of 70 bp at 3* end, the whole hydantoinase gene was 
reconstructed using two linkers having the sequence: 



5 LINKER 5' 

5 1 AATTC TTATG GAT 3* 

to EcoRI 

LINKER 3« 

5 1 TCGA CGAGGG AACCTACGTG GGGGCGCCGA CGGATGGCCA 

15 

Sail 

GTTCCGGAAG CGCCGCAAAT ACAAGCAATA AGGAGG 3 f 
° EcoRI 



40 ng of the 1300 bp fragment. 40 ng of the linker 3\ 10 ng of the linker 5' and 50 ng of the plasmid 

pSM671 CBS 205.94. previously linearized with EcoRI, were then ligated in a ligase mixture containing 1 U 
25 of T4 DNA ligase. incubating at 12' C for 16 hours. The ligase mixture was subsequently used to transform 

competent cells of E.coli 71/18 and the transformants were selected on plates of LB medium to which 20 

ug/ml of chloramphenicol had been added. 

The plasmid DNAs isolated from some of the positive clones were analyzed to identify the clones 

containing the complete sequence of the hydantoinase gene. 
30 One of these plasmids was called pSM650 and the strain of E.coli containing said plasmid was marked 

with the abbreviation SMC308. 

100 ng of the plasmid pSM650 were used to transform competent cells of B.subtilis SMS108. The resulting 
strain was called SMS375. 

35 Example 7 

Expression of the hydantoinase gene in E.coli and B.subtilis 

The object of the experiment was to verify the capacity of the transformed strains (E.coli SMS308 and 

40 B.subtilis SMS375) to express the hydantoinase gene without inducers. 

A preculture on slant of the strain E.coli SMS308 and B.subtilis SMS375 was inoculated into two 50 ml 
flasks containing, respectively, 10 ml of LB medium to which 5 ug/ml of chloramphenicol had been added 
and 10 ml of VY medium to which 20 ug/ml of chloramphenicol had been added. The flasks were 
incubated, under gentle stirring, (220 rpm), at 37 # C for 16 hours. 

45 The cells were recovered by centrifugation (12.000 rpm, 4'C, for 1 minute) of the two culture broths, 
resuspended in 300 ul of buffer 20 mM Tris-HCI pH 7.5, 20 mM BMeOH, 20% glycerol and lysed by 
sonication (1 minute impulses, at average voltage). Aliquots (15 ul) of the two lysates were charged onto 
polyacryalamide gel at 10% and run at 20 mA for three hours. The proteic bands were visualized by 
colouring with Coomassie R-250 (Laemmli, Nature: 227, 680. 1970). After colouring with Coomassie a 

so proteic band was revealed with a molecular weight of 50,000 Daltons absent in the extracts of untransfor- 
med strains B.subtilis SMS108 and E.coli 71/18. In addition, densitometric analysis carried out on the same 
gel coloured with Coomassie showed that this protein was expressed in the two transformed strains as one 
of the prevalent proteins (10% with respect to the total proteins). 

55 
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Example 8 

Cloning of the hydantoinase-carbamylase operon 

5 The plasmid pSM650 (1 ug) was digested with the enzyme EcoRI (5 U) at 37 *C for 1 hour. The EcoRI- 
EcoRI fragment of about 1380 bp containing the hydantoinase gene was purified by agar gel at 0.8% with 
the Gelase TM method. 20 ng of this fragment were ligated with 50 ng of the plasmid pSM637 linearized 
with EcoRI. The reaction was carried out in a ligase buffer containing 1 U of T4 DNA ligase, at 16 'C for 16 
hours. 

io The ligase mixture was used to transform competent cells of E.coli 71/18. 

The transformants were subsequently selected on plastes of LB medium (8 g/l Bactotryptone (DIFCO), 
5 g/l NaCI, 15 g/I Agar (DIFCO), 0.5 g/l yoast extract) to which 20 ug/ml of Chloramphenicol had been 
added. 

The positive clones were analyzed by restriction analysis to verify the correct insertion into the two 
75 genes. The plasmid containing the hydantoinase-carbamylase operon was called pSM651 and the strain of 
E.coli containing said plasmid was marked with the abbreviation SMC305. 

Competent cells of B.subtilis SMS108 were transformed with 100 ng of this plasmid. One of the positive 
clones was called SMS373. 

20 Example 9 

Expression of the hydantoinase-carbamylase operon 

E.coli SMS305 and B.subtilis SMS373 were cultured, respectively, in 100 ml of LB medium to which 20 
25 ug of chloramphenicol had been added and in 100 ml of VY medium to which 5 ug of chloramphenicol had 
been added, at 37 'C for 16 hours, under stirring (200 rpm). The proteic extracts obtained from the cellular 
lysates were analyzed as described in example 7. The results showed the presence of two proteins 
corresponding to hydantoinase and carbamylase (figure 6). To evaluate the activity of these enzymes, a 
reaction kinetics was carried out using 20 mM (D,L) parahydroxyphenyl-hydantoin as substrate or alter- 
30 natively 5-phenyl-hydantoin (in 200 mM of phosphate buffer pH 8) and following the conversion into the 
corresponding D-a-amino acid with the evolution of ammonia. The process adopted is described by 
Weatherburn, M.W., (1967), (Anal. Chem., 39:971). 

Example 10 

35 

Conversion of D,L-5-phenylhydantoin to D-phenylglycine 

A suspension of 2 g of D,L-5-phenyl-hydantoin in 100 ml of Na-phosphate 0.2 M buffer pH 8.0 was 
charged into an apparatus equipped with a stirrer and thermostat-regulated at 40 *C. After degassing with 

40 nitrogen at 40* C for 5 minutes, 5 g (humid weight) of biomass was introduced, coming from a culture of 
E.coli SMS305, carried out as described in example 9. 

After the apparatus had been hermetically closed, the reaction mixture was maintained under a nitrogen 
atmosphere, at 40 *C for 24 hours. Polarimetric and thin layer chromatographic analysis (J. of Chromatog- 
raphy, 80: 199-204), 1973) of an aliquot of the reaction mixture showed the complete hydrolysis of the 

45 starting substrate to D-phenylglycine. 

After separation of the biomass by centrifugation of the reaction mixture at 6000 rpm for 10 minutes, the 
surnatant was acidified to pH 1.0 with HCI 6 M and charged onto a column (2.6 x 20 cm) of Amberlite IR 
120 (activated with HCI). The column was then washed with water and eluted with an ammonia solution at 
5% in water. The eluate was decoloured with decolouring carbon (C.Erba), and the decoloured solution was 

50 concentrated under vacuum and brought to pH 5.8. The crystals thus obtained were recovered by filtration 
and recrystallized from water. The white powder obtained (1.63 g) showed a specific rotation [a]o20 = 
-156* (c = 1,1 N HCI). The IR spectrum was in agreement with that of the standard D-phenylglycine. 
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Example 1 1 

Conversion of D,L-5-phenylhydandoin to D-phenylglycine. 

The same procedure was carried out as in example 10, using 5 g (humid weight) of biomass coming 
from the culture of E.coli SMS305 and 10 g of D,L-5-phenylhydantoin in 100 ml of Na-phosphate 0.2 M 
buffer pH 8.0. The reaction was carried out under a nitrogen atmosphere, at 40 * C for 90 hours. The white 
powder obtained (8.1 g) showed a specific rotation [u]o20 = -156.5* (c = 1, 1 N HCI). The IR spectrum 
agreed with that of the standard D-phenylglycine. 



10 



Example 12 

Conversion of D,L-5-para-hydroxy-phcnylhydantoin to D-para-hydroxy-phenylglycine :5^3^>?.:;'""; 

rs The same procedure was carried out as in example 10, using 2.5 g (humid weight) of biomass and 1 g 

of D,L-5-para-hydroxy-phenylhydantoin The D-para-hydroxy-phenylglycine obtained as a white powder (0.82 k 
g) showed a specific rotation [a]o20 = -158* (c = 1,1 N HCI). The IR spectrum was in agreement with that 
of the standard D-phenylglycine. 

20 Example 13 

Conversion of D,L-5-para-hydroxy-phenylhydantoin to D-para-hydroxy-phenylglycine 



The same procedure was carried out as in example 10. using 2.5 g of biomass (humid weight) obtained 
25 from the culture of E.coli SMS305 and 8 g of D,L-5-parahydroxy-phenylhydantoin. 

The reaction was carried out under a nitrogen atmosphere, at 40 *C for 170 hours. The D-parahydroxy- 
phenylglycine obtained as a white powder (6.6 g) showed a specific rotation [a]o20 = -157.8* (c = 1,1 N 
HCI). The IR spectrum was in agreement with that of the standard D-phenylglycine. 

30 Example 14 

Conversion of D,L-5-isopropy(hydantoin to D-valine 

The same procedure was carried out as in example 1 0, using 5.0 g of biomass (humid weight) obtained 
35 from the culture of E.coli SMS305 and 2 g of D,L-5-isopropylhydantoin. 

The reaction was carried out under a nitrogen atmosphere, at 40 *C for 240 hours. The D-valine 
obtained as a white powder (0.8 g) showed a specific rotation [a]o20 = -27.5* (c = 5, 6 N HCI). The IR 
spectrum agreed with that of the standard D-valine. 

40 
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SEQUENCE LISTING 

NUMBER OF SEQUENCES; 19 

(1) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT : 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 

CGAATTGTAA ATTATGCAGC AGC 23 
(1) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(Xi) SEQUENCE DESCRIPTION: 
AGGATCGTGA ACTACGCGGC GGC 23 
(1) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(Xi) SEQUENCE DESCRIPTION: 

CGCATAGTCA ATTATGCCGC CGC 23 
(1) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA( Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGTATTGTTA ATTATGCTGC TGC 

(1) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA( Genomic) 
(xi) SEQUENCE DESCRIPTION: 
GGACCAATTC AACGAGC 

(1) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT; 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA( Genomic) 
(Xi) SEQUENCE DESCRIPTION: 
GGGCCGATCC AGCGGGC 

(1) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA( Genomic) 
(xi) SEQUENCE DESCRIPTION: 

GGCCCCATAC AACGCGC 17 
(1) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs I 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA ( Genomic ) 
(xi) SEQUENCE DESCRIPTION: 
GGTCCTATTC AACGTGC 17 
(1) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs y-Tr T 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGAGCAGATG TAATGGA 17 

(1) INFORMATION FOR SEQ ID NO: 10: j*- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
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(xi) SEQUENCE DESCRIPTION: 

AGGGCGGACG TGATGGA 17 
(1) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA ( Genomic ) 
(xi) SEQUENCE DESCRIPTION: 
CGCGCCGATG TCATGGA 17 
(1) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single f%^^& 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGTGCTGATG TTATGGA 17 
(1) INFORMATION FOR SEQ ID NO: 13: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 27 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(II) MOLECULE TYPE: DNA(Genomic) 
(xi) SEQUENCE DESCRIPTION: 

ATCGTAACCG CGGACGGGAT TTCTCCC 27 
(1) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 22 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(ix) FEATURE: 
(A) NAME: Primer 
(xi) SEQUENCE DESCRIPTION: 
GGGAATTCTT ATGACACGTC AG 22 
(1) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA(Genomic) 
(ix) FEATURE: 
(A) NAME: Primer 
(xi) SEQUENCE DESCRIPTION: 
CCCAAGCTTC AAAATTCCGC GAT 23 
(1) INFORMATION FOR SEQ ID NO: 16: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 13 base pairs 

(B) TYPE: Nucleic acid h: 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 

(ix) FEATURE: t ; . 

(A) NAME: Linker 
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(xi) SEQUENCE DESCRIPTION: 

AATTCTTATG GAT 13 

(1) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 76 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA(Genomic) 
(ix) FEATURE: 

(A) NAME: Linker 

(xi) SEQUENCE DESCRIPTION: 

TCGACGAGGG AACCTACGTG GGGGCGCCGA CGGATGGCCA 40 
GTTCCGGAAG CGCCGCAAAT ACAAGCAATA AGGAGG 76 
(1) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 915 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aarobacterium radiobacter 
(Xi) SEQUENCE DESCRIPTION: 

ATG ACA CGT CAG ATG ATA CTT GOT GTC GGA CAG CAA GGC CCC ATC 
Met Thr Arg Gin Met He Leu Ala Val Gly Gin Gin Gly Pro He 

5 10 15 

GCG CGA GCG GAG ACA CGC GAA CAG GTG GTT GGC CGC CTC CTC GAC 
Ala Arg Ala Glu Thr Arg Glu Gin Val Val Gly Arg Leu Leu Asp 
20 25 30 
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ATG TTG ACG AAC GCA GCC AGC CGG GGC GTG AAC TTC ATC GTC TTT 135 
Met Leu Thr Asn Ala Ala Ser Arg Gly Val Asn Phe He Val Phe 

35 40 45 

CCC GAG CTT GCG CTC ACG ACC TTC TTC CCG CGC TGG CAT TTC ACC 180 
Pro Glu Leu Ala Leu Thr Thr Phe Phe Pro Arg Trp His Phe Thr 

50 55 60 

GAC GAG GCC GAG CTC GAT AGC TTC TAT GAG ACC GAA ATG CCC GGC 225 
Asp Glu Ala Glu Leu Asp Ser Phe Tyr Glu Thr Glu Met Pro Gly 

65 70 75 '^M- 

CCG GTG GTC CGT CCA CTC TTT GAG ACG GCC GCC GAA CTC GGG ATC 270 
Pro Val Val Arg Pro Leu Phe Glu Thr Ala Ala Glu Leu Gly He 

80 85 90 

GGC TTC AAT CTG GGC TAC GCC GAA CTC GTC GTC GAA GGC GGC GTC 315 
Gly Phe Asn Leu Gly Tyr Ala Glu Leu Val Val Glu Gly Gly Val 

95 100 105 

AAG CGT CGC TTC AAC ACG TCC ATT CTG GTG GAT AAG TCA GGC AAG 360 
Lys Arg Arg Phe Asn Thr Ser He Leu Val Asp Lys Ser Gly Lys 

110 115 120 

ATC GTC GGC AAG TAT CGT AAG ATC CAT TTG CCG GGT CAC AAG GAG 415 
He Val Gly Lys Tyr Arg Lys He His Leu Pro Gly Hys Lys Glu 

125 130 135 

TAC GAG GCC TAC CGG CCG TTC CAG CAT CTT GAA AAG CGT TAT TTC 450 
Tyr Glu Ala Tyr Arg Pro Phe Gin His Leu Glu Lys Arg Tyr Phe 

140 145 150 hf^^^F 

GAG CCG GGC GAT CTC GGC TTC CCG GTC TAT GAC GTC GAC GCC GCG 495 
Glu Pro Gly Asp Leu Gly Phe Pro Val Tyr Asp Val Asp Ala Ala 

155 160 165 

AAA ATG GGG ATG TTC ATC TGC AAC GAT CGC CGC TGG CCT GAA ACG 540 
Lys Met Gly Met Phe He Cys Asn Asp Arg Arg Trp Pro Glu Thr 

170 175 180 

VV-V-V-VVr* 
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TGG CGG GTG ATG GGA CTT AAG GGC GCC GAG ATC ATC TGC GGC GGC 585 ^jj 
Trp Arg Val Met Gly Leu Lys Gly Ala Glu lie He Cys Gly Gly 

185 190 195 

TAC AAC ACG CCG ACC CAC AAT CCC CCC GTT CCC CAG CAC GAC CAT 630 
Tyr Asn Thr Pro Thr His Asn Pro Pro Val Pro Gin His Asp His 

200 205 210 

CTG ACG TCC TTC CAC CAC CTT CTG TCG ATG CAG GCC GGG TCG TAC 675 
Leu Thr Ser Phe His His Leu Leu Ser Met Gin Ala Gly Ser Tyr 

215 220 225 

CAA AAC GGC GCC TGG TCC GCG GCG GCC GGC AAG GTC GGC ATG GAG 720 
Gin Asn Gly Ala Trp Ser Ala Ala Ala Gly Lys Val Gly Met Glu 

230 235 240 

GAG GGG TGC ATG CTG CTC GGC CAT TCG TGC ATC GTG GCG CCG ACC 765 
Glu Gly Cys Met Leu Leu Gly His Ser Cys He Val Ala Pro Thr 

245 250 255 

GGC GAA ATC GTT GCC CTG ACC ACG ACG TTG GAA GAC GAG GTG ATC 810 *ss&isa£: 
Gly Glu He Val Ala Leu Thr Thr Thr Leu Glu Asp Glu Val He 

260 265 270 

ACC GCC GCC GTC GAT CTC GAC CGC TGC CGG GAA CTG CGC GAA CAC 855 
Thr Ala Ala Val Asp Leu Asp Arg Cys Arg Glu Leu Arg Glu His 

275 280 285 

ATC TTC AAT TTC AAA GCC CAT CGT CAG CCA CAG CAC TAC GGT CTG 900 
He Phe Asn Phe Lys Ala His Arg Gin Pro Gin His Tyr Gly Leu 

290 295 300 

ATC GCG GAA TTT TGA 915 
He Ala Glu Phe STOP 
(1) INFORMATION FOR SEQ ID NO: 19 : 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 1373 base pairs 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA (Genomic) 
(vi) ORIGINAL SOURCE 

(A) ORGANISM: Aarobacterium radiobacter 
10 (xi) SEQUENCE DESCRIPTION: 

ATG GAT ATC ATC ATC AAG AAC GGA ACC ATC GTA ACC GCG GAC GGG 45 
Met Asp lie He He Lys Asn Gly Thr He Val Thr Ala Asp Gly 

5 10 15 

ATT TCT CCC GCC GAT CTC GGA ATC AAG GAT GGC AAG ATC GCC CAG 90 
He Ser Pro Ala Asp Leu Gly He Lys Asp Gly Lys He Ala Gin 

20 25 30 

ATC GGC GGA ACG TTC GGC CCG GCC GGC CGG ACA ATC GAC GCC TCC 135 
He Gly Gly Thr Phe Gly Pro Ala Gly Arg Thr He Asp Ala Ser 

35 40 45 

GGC CGC TAC GTT TTT CCG GGC GGC ATC GAC GTT CAT ACG CAT GTC 180 
Gly Arg Tyr Val Phe Pro Gly Gly He Asp Val His Thr His Val 

50 55 60 

GAG ACG GTC AGC TTC AAC ACG CAG TCG GCC GAC ACA TTC GCA ACC 225 
Glu Thr Val Ser Phe Asn Thr Gin Ser Ala Asp Thr Phe Ala Thr 

65 70 75 

GCG ACG GTC GCG GCC GCC TGT GGC GGC ACG ACG ACC ATC GTC GAT 270 
Ala Thr Val Ala Ala Ala Cys Gly Gly Thr Thr Thr lie Val Asp 

80 85 90 

TTC TGC CAG CAG GAC CGC GGC CAT AGC CTG AGG GAG GCG GTC GCC 315 -J-> v - _ 

Phe Cys Gin Gin Asp Arg Gly Hys Ser Leu Arg Glu Ala Val Ala 
45 95 100 105 

AAA TGG GAC GGC ATG GCC GGC GGC AAG TCG GCG ATC GAC TAC GGC 360 
Lys Trp Asp Gly Met Ala Gly Gly Lys Ser Ala He Asp Tyr Gly 
so 110 115 120 ( 
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TAC CAT ATC ATC GTG CTC GAT CCG ACT GAT AGC GTG ATC GAG GAG 405 
Tyr His lie He Val Leu Asp Pro Thr Asp Ser Val lie Glu Glu 
125 130 135 

CTA GAG GTA CTG CCA GAT CTC GGC ATC ACC TCC TTC AAG GTC TTC 450 
Leu Glu Val Leu Pro Asp Leu Gly He Thr Ser Phe Lys Val Phe 
140 145 150 

ATG GCT TAT CGC GGC ATG AAC ATG ATC GAC GAC GTG ACA CTG CTC 495 
Met Ala Tyr Arg Gly Met Asn Met He Asp Asp Val Thr Leu Leu 
155 160 165 

AGG ACG CTC GAC AAG GCC GCC AAG ACT GGG TCA CTC GTC ATG GTG 540 
Arg Thr Leu Asp Lys Ala Ala Lys Thr Gly Ser Leu Val Met Val 
170 175 180 

CAC GCG GAG AAC GGC GAC GCC GCC GAC TAT CTT CGC GAC AAG TTC 585 
His Ala Glu Asn Gly Asp Ala Ala Asp Tyr Leu Arg Asp Lys Phe 
185 190 195 

GTC GCC GAT GGC AAA ACG GCG CCG ATC TAC CAC GCG CTC AGC CGT 630 
Val Ala Asp Gly Lys Thr Ala Pro He Tyr His Ala Leu Ser Arg 
200 205 210 

CCG CCC CGG GTC GAA GCC GAG GCG ACC GCG AGG GCC CTC GCC CTG 675 
Pro Pro Arg Val Glu Ala Glu Ala Thr Ala Arg Ala Leu Ala Leu 
215 220 225 

GCG GAA ATC GTC AAC GCC CCG ATC TAC ATC GTG CAT CTG ACC TGC 720 
Ala Glu He Val Asn Ala Pro He Tyr He Val His Leu Thr Cys 
230 235 240 

GAA GAA TCC TTC GAC GAG TTG ATG CGG GCA AAG GCT CGG GGT GTC 765 
Glu Glu Ser Phe Asp Glu Leu Met Arg Ala Lys Ala Arg Gly Val 
245 250 255 

CAC GCC CTG GCC GAA ACC TGC ACA CAA TAC CTC TAC CTC ACC AAG 810 
His Ala Leu Ala Glu Thr Cys Thr Gin Tyr Leu Tyr Leu Thr Lys 
260 265 270 
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GAC GAC CTG GAG CGG CCG GAT TTC GAG GGC GCG AAG TAT GTT TTC 855 
Asp Asp Leu Glu Arg Pro Asp Phe Glu Gly Ala Lys Tyr Val Phe 

275 280 285 

ACC CCG CCT CCG CGC ACG AAG AAG GAC CAG GAA ATC CTC TGG AAC 900 
Thr Pro Pro Pro Arg Thr Lys Lys Asp Gin Glu lie Leu Trp Asn 

290 295 300 

GCA CTC CGG AAC GGG GTC CTC GAA ACG GTC TCC TCG GAC CAT TGT 945 

Ala Leu Arg Asn Gly Val Leu Glu Thr Val Ser Ser Asp His Cys 'Yl\r* 

305 310 315 

TCC TGG CTC TTC GAG GGG CAC AAG GAT CGG GGC AGG AAC GAC TTC 990 l 'i 

Ser Trp Leu Phe Glu Gly His Lys Asp Arg Gly Arg Asn Asp Phe 

320 325 330 

CGC GCC ATC CCG AAC GGA GCG CCG GGC GTC GAG GAG CGG CTG ATG 1035 
Arg Ala lie Pro Asn Gly Ala Pro Gly Val Glu Glu Arg Leu Met 

335 340 345 | 

ATG GTC TAT CAG GGC GTC AAC GAA GGC CGC ATT TCC CTC ACC CAG 1080 il..:..:.- . . 

Met Val Tyr Gin Gly Val Asn Glu Gly Arg lie Ser Leu Thr Gin 

350 355 360 

TTC GTA GAA CTG GTC GCC ACG CGC CCG GCC AAG GTC TTC GGC ATG 1125 
Phe Val Glu Leu Val Ala Thr Arg Pro Ala Lys Val Phe Gly Met 

365 370 375 

TTC CCG GAA AAA GGA ACG GTC GCG GTC GGT TCG GAT GCC GAC ATC 1170 
Phe Pro Glu Lys Gly Thr Val Ala Val Gly Ser Asp Ala Asp He 

380 385 390 

GTC CTT TGG GAT CCC GAG GCT GAA ATG GTG ATC GAA CAA AGC GCC 1215 
Val Leu Trp Asp Pro Glu Ala Glu Met Val He Glu Gin Ser Ala 

395 400 405 

ATG CAT AAC GCC ATG GAT TAC TCC TCC TAC GAG GGA CAC AAG ATC 1260 
Met His Asn Ala Met Asp Tyr Ser Ser Tyr Glu Gly His Lys He 

410 415 420 
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AAG GGC GTG CCG AAG ACA GTG CTG CTG CGT GGC AAG GTG ATC GTC 1305 

Lys Gly Val Pro Lys Thr Val Leu Leu Arg Gly Lys Val lie Val 
425 430 435 

GAC GAG GGA ACC TAC GTG GGG GCG CCG ACG GAT GGC CAG TTC CGG 1350 

Asp Glu Gly Thr Tyr Val Gly Ala Pro Thr Asp Gly Gin Phe Arg 
440 445 450 
AAG CGC CGC AAA TAC AAG CAA TAA 1373 
75 Lys Arg Arg Lys Tyr Lys Gin STOP 

455 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: ENIRICERCHE S.p.A. 

(B) STREET: VIA MARITANO , 26 

(C) CITY: SAN DONATO MILANESE, (MILANO) 

(E) COUNTRY: ITALY 

(F) POSTAL CODE: 20097 

TELEFAX: 02/52036344 V:^> : ; 

(ii) TITLE INVENTION: f^^^.? 
15 PROCESS FOR THE PRODUCTION OF D-N-ALFA AMINO ACIDS 

(iii) NUMBER OF SEQUENCES: 19 

(iv) COMPUTER READABLE FORM: 
(A) MEDIUM TYPE: Diskette 
(iv) COMPUTER: IBM PC-Compat ibie 

OPERATING SYSTEM: IBM-DOS 5. 2 /WINDOWS 3.1 : 
SOFTWARE : DisplayWrite 4 
(1) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 2 3 base pairs ^^:^> 

(B) TYPE: Nucleic acid 

so (C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 

35 CGAATTGTAA ATTATGCAGC AGC 2 3 

(1) INFORMATION FOR SEQ ID NO:2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: DNA (Genomic) 

(xi) SEQUENCE DESCRIPTION: 

AGGATCGTGA ACTACGCGGC GGC 23 

(1) INFORMATION FOR SEQ ID NO: 3: 
50 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGHT: 2 3 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 
( ii). MOLECULE TYPE: DNA (Genomic) 
{xi) SEQUENCE DESCRIPTION: 

CGCATAGTCA A TT ATGCCGC CGC 23 
(1) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 

CGTATTGTTA ATTATGCTGC TGC 2 3 

(1) INFORMATION FOR SEQ ID NO : 5 : 
25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(Xi) SEQUENCE DESCRIPTION: 
GGACCAATTC AACGAGC 17 
(1) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
45 (xi) SEQUENCE DESCRIPTION: 

GGGCCGATCC AGCGGGC 17 
(1) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGHT: 17 base pairs 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
GGCCCCATAC AACGCGC 17 
(1) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

'5 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
GGTCCTATTC AACGTGC 17 
(1) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: DNA (Genomic) 

(xi) SEQUENCE DESCRIPTION: 

CGAGCAGATG TAATGGA 17 
(1) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 

AGGGCGGACG TGATGGA 17 
(1) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGCGCCGATG TCATGGA 17 
(1) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs -; : iy. V;^ 

(B) TYPE: Nucleic acid W#Mh^ 

(C) STRANDEDNESS: Single £B^-V. 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGTGCTGATG TTATGGA 17 
(1) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 27 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 

ATCGTAACCG CGGACGGGAT TTCTCCC 2 7 

(1) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 22 base pairs h %: 

(B) TYPE: Nucleic acid ^ 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear U <> ^P, 

(ii) MOLECULE TYPE: DNA (Genomic) . 

(ix) FEATURE: \ • 

(A) NAME: Primer 

(xi) SEQUENCE DESCRIPTION: 

GGG A ATTCTT ATGACACGTC AG 22 .A 

(1) INFORMATION FOR SEQ ID NO: 15: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGHT: 23 base pairs -*> 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 

<ix) FEATURE : 

(A) NAME: Primer 

(xi) SEQUENCE DESCRIPTION: 

CCCAAGCTTC AAAATTCCGC GAT 2 3 

(1) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 13 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
<ix) FEATURE: 

(A) NAME: Linker 

(xi) SEQUENCE DESCRIPTION: 

AATTCTTATG GAT 13 

(1) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 76 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(ix) FEATURE: 

(A) NAME: Linker 

(xi) SEQUENCE DESCRIPTION: 

TCGACGAGGG AACCTACGTG GGGGCGCCGA CGGATGGCCA 
GTTCCGGAAG CGCCGCAAAT ACAAGCAATA AGGAGG 
(1) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 915 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA (Genomic) 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aarobacter ium radiobacter 
(xi) SEQUENCE DESCRIPTION: 

ATG AC A CGT CAG ATG ATA CTT GCT GTC GGA CAG CAA GGC CCC ATC 45 
Met Thr Arg Gin Met He Leu Ala Val Gly Gin Gin Gly Pro He 

5 10 15 

GCG CGA GCG GAG ACA CGC GAA CAG GTG GTT GGC CGC CTC CTC GAC 90 % 
Ala Arg Ala Glu Thr Arg Glu Gin Val Val Gly Arg Leu Leu Asp ^/'^f- 
2 0 25 30 *kfev ! . : 

ATG TTG ACG AAC GCA GCC AGC CGG GGC GTG AAC TTC ATC GTC TTT 135 |: . 

Met Leu Thr Asn Ala Ala Ser Arg Gly val Asn Phe He Val Phe 

35 4 0 4 5 P: :L 

20 CCC GAG CTT GCG CTC ACG ACC TTC TTC CCG CGC TGG CAT TTC ACC 180 
Pro Glu Leu Ala Leu Thr Thr Phe Phe Pro Arg Trp His Phe Thr 

50 55 60 • • 

GAC GAG GCC GAG CTC GAT AGC TTC TAT GAG ACC GAA ATG CCC GGC 225 

Asp Glu Ala Glu Leu Asp Ser Phe Tyr Glu Thr Glu Met Pro Gly 1 

65 70 75 

CCG GTG GTC CGT CCA CTC TTT GAG ACG GCC GCC GAA CTC GGG ATC 270 
Pro Val Val Arg Pro Leu Phe Glu Thr Ala Ala Glu Leu Gly He 

80 85 90 

GGC TTC AAT CTG GGC TAC GCC GAA CTC GTC GTC GAA GGC GGC GTC 315 
Gly Phe Asn Leu Gly Tyr Ala Glu Leu Val Val Glu Gly Gly Val 
95 100 105 

AAG CGT CGC TTC AAC ACG TCC ATT CTG GTG GAT AAG TCA GGC AAG 360 
Lys Arg Arg Phe Asn Thr Ser He Leu Val Asp Lys Ser Gly Lys 
110 115 120 

40 ATC GTC GGC AAG TAT CGT AAG ATC CAT TTG CCG GGT CAC AAG GAG 405 
lie Val Gly Lys Tyr Arg Lys He His Leu Pro Gly Hys Lys Glu 
125 130 135 

TAC GAG GCC TAC CGG CCG TTC CAG CAT CTT GAA AAG CGT TAT TTC 450 
Tyr Glu Ala Tyr Arg Pro Phe Gin His Leu Glu Lys Arg Tyr Phe 
140 145 150 

GAG CCG GGC GAT CTC GGC TTC CCG GTC TAT GAC GTC GAC GCC GCG 495 
Glu Pro Gly Asp Leu Gly Phe Pro Val Tyr Asp Val Asp Ala Ala 
155 160 165 
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AAA ATG GCG ATG TTC ATC TGC AAC GAT CGC CGC TGG CCT GAA ACG 54 0 r " : — 

Lys Met Gly Met Phe He Cys Asn Asp Arg Arg Trp Pro Glu Thr 
170 175 180 

TGG CGG GTG ATG GGA CTT AAG GGC GCC GAG ATC ATC TGC GGC GGC 58 5 
Trp Arg Val Met Gly Leu Lys Gly Ala Glu He He Cys Gly Gly 
185 190 195 

TAC AAC ACG CCG ACC CAC AAT CCC CCC GTT CCC CAG CAC GAC CAT 6 30 
Tyr Asn Thr Pro Thr His Asn Pro Pro Val Pro Gin His Asp His 

200 205 210 -..: : >v 

CTG ACG TCC TTC CAC CAC CTT CTG TCG ATG CAG GCC GGG TCG TAC 675 ■■Vi-" * 

Leu Thr Ser Phe His His Leu Leu Ser Met Gin Ala Gly Ser Tyr 

215 220 225 v*. : ^tC: 

CAA AAC GGC GCC TGG TCC GCG GCG GCC GGC AAG GTC GGC ATG GAG 720 

Gin Asn Gly Ala Trp Ser Ala Ala Ala Gly Lys Val Gly Met Glu ■■-■^J' : C-- 

230 235 * 240 

GAG GGG TGC ATG CTG CTC GGC CAT TCG TGC ATC GTG GCG CCG ACC 765 
Glu Gly Cys Met Leu Leu Gly His Ser Cys He Val Ala Pro Thr 
245 250 255 

GGC GAA ATC GTT GCC CTG ACC ACG ACG TTG GAA GAC GAG GTG ATC 810 
Gly Glu He Val Ala Leu Thr Thr Thr Leu Glu Asp Glu Val lie 
260 265 270 

30 ACC GCC GCC GTC GAT CTC GAC CGC TGC CGG GAA CTG CGC GAA CAC 855 

Thr Ala Ala Val Asp Leu Asp Arg Cys Arg Glu Leu Arg Glu His 
275 280 285 

ATC TTC AAT TTC AAA GCC CAT CGT CAG CCA CAG CAC TAC GGT CTG 900 
35 He Phe Asn Phe Lys Ala His Arg Gin Pro Gin His Tyr Gly Leu 

290 295 300 

ATC GCG GAA TTT TGA 915 
^ He Ala Glu Phe 

(1) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: iv 
(A) LENGHT: 1374 base pairs 

45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(vi) ORIGINAL SOURCE 



10 



15 



20 



25 



31 



BNSDOCID: <EP 0677585A1J > 



EP 0 677 585 A1 



10 



20 



25 



35 



40 



45 



(A) ORGANISM: Aarobacterium radiobacter 
(xi) SEQUENCE DESCRIPTION: 

ATG GAT ATC ATC ATC AAG AAC GGA ACC ATC GTA ACC GCG GAC GGG 45 
Met Asp lie lie lie Lys Asn Gly Thr lie Val Thr Ala Asp Gly 

5 10 15 

ATT TCT CCC GCC GAT CTC GGA ATC AAG GAT GGC AAG ATC GCC CAG 90 
He Ser Pro Ala Asp Leu Gly He Lys Asp Gly Lys He Ala Gin 

20 25 30 

ATC GGC GGA ACG TTC GGC CCG GCC GGC CGG AC A ATC GAC GCC TCC 135 WM^'Z 
75 He Gly Gly Thr Phe Gly Pro Ala Gly Arg Thr He Asp Ala Ser kS^'Jffi 

35 4 0 45 

GGC CGC TAC GTT TTT CCG GGC GGC ATC GAC GTT CAT ACG CAT GTC 180 

Gly Arg Tyr Val Phe Pro Gly Gly He Asp Val His Thr His Val li^il.^-'"''^ 

50 55 60 

GAG ACG GTC AGC TTC AAC ACG CAG TCG GCC GAC ACA TTC GCA ACC 2 25 

Glu Thr Val Ser Phe Asn Thr Gin Ser Ala Asp Thr Phe Ala Thr 

65 70 75 :.>•.■¥=••:■. 

GCG ACG GTC GCG GCC GCC TGT GGC GGC ACG ACG ACC ATC GTC GAT 270 
Ala Thr Val Ala Ala Ala Cys Gly Gly Thr Thr Thr He Val Asp ^I^IWI!^ 
80 85 90 

30 TTC TGC CAG CAG GAC CGC GGC CAT AGC CTG AGG GAG GCG GTC GCC 315 
Phe Cys Gin Gin Asp Arg Gly Hys Ser Leu Arg Glu Ala Val Ala 
95 100 105 

AAA TGG GAC GGC ATG GCC GGC GGC AAG TCG GCG ATC GAC TAC GGC 360 
Lys Trp Asp Gly Met Ala Gly Gly Lys Ser Ala He Asp Tyr Gly 
110 115 120 

TAC CAT ATC ATC GTG CTC GAT CCG ACT GAT AGC GTG ATC GAG GAG 405 
Tyr His He He Val Leu Asp Pro Thr Asp Ser Val He Glu Glu 
125 130 135 

CTA GAG GTA CTG CCA GAT CTC GGC ATC ACC TCC TTC AAG GTC TTC 450 
Leu Glu Val Leu Pro Asp Leu Gly He Thr Ser Phe Lys Val Phe 

140 145 150 /: : 

ATG GCT TAT CGC GGC ATG AAC ATG ATC GAC GAC GTG ACA CTG CTC 495 
Met Ala Tyr Arg Gly Met Asn Met He Asp Asp Val Thr Leu Leu 
155 160 165 

50 AGG ACG CTC GAC AAG GCC GCC AAG ACT GGG TCA CTC GTC ATG GTG 540 
Arg Thr Leu Asp Lys Ala Ala Lys Thr Gly Ser Leu Val Met Val 
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170 175 180 

CAC GCG GAG AAC GGC GAC GCC GCC GAC TAT CTT CGC GAC AAG TTC 585 
His Ala Glu Asn Gly Asp Ala Ala Asp Tyr Leu Arg Asp Lys Phe 

185 190 195 

GTC GCC GAT GGC AAA ACG GCG CCG ATC TAC CAC GCG CTC AGC CGT 630 
Val Ala Asp Gly Lys Thr Ala Pro lie Tyr His Ala Leu Ser Arg 

200 205 210 

CCG CCC CGG GTC GAA GCC GAG GCG ACC GCG AGG GCC CTC GCC CTG 675 
Pro Pro Arg Val Glu Ala Glu Ala Thr Ala Arg Ala Leu Ala Leu 

215 220 225 

GCG GAA ATC GTC AAC GCC CCG ATC TAC ATC GTG CAT CTG ACC TGC 720 
Ala Glu He Val Asn Ala Pro He Tyr He Val His Leu Thr Cys 

230 235 240 

GAA GAA TCC TTC GAC GAG TTG ATG CGG GCA AAG GCT CGG GGT GTC 765 
Glu Glu Ser Phe Asp Glu Leu Met Arg Ala Lys Ala Arg Gly Val 

245 250 255 

CAC GCC CTG GCC GAA ACC TGC ACA CAA TAC CTC TAC CTC ACC AAG 810 
His Ala Leu Ala Glu Thr Cys Thr Gin Tyr Leu Tyr Leu Thr Lys 

260 265 270 

GAC GAC CTG GAG CGG CCG GAT TTC GAG GGC GCG AAG TAT GTT TTC 855 
Asp Asp Leu Glu Arg Pro Asp Phe Glu Gly Ala Lys Tyr Val Phe 

275 280 285 

ACC CCG CCT CCG CGC ACG AAG AAG GAC CAG GAA ATC CTC TGG AAC 900 
Thr Pro Pro Pro Arg Thr Lys Lys Asp Gin Glu He Leu Trp Asn 

290 295 300 

GCA CTC CGG AAC GGG GTC CTC GAA ACG GTC TCC TCG GAC CAT TGT 945 
Ala Leu Arg Asn Gly Val Leu Glu Thr Val Ser Ser Asp His Cys 

305 310 315 

TCC TGG CTC TTC GAG GGG CAC AAG GAT CGG GGC AGG AAC GAC TTC 990 
Ser Trp Leu Phe Glu Gly His Lys Asp Arg Gly Arg Asn Asp Phe 

320 325 330 

CGC GCC ATC CCG AAC GGA GCG CCG GGC GTC GAG GAG CGG CTG ATG 1035 
Arg Ala He Pro Asn Gly Ala Pro Gly Val Glu Glu Arg Leu Met 

335 340 345 

ATG GTC TAT CAG GGC GTC AAC GAA GGC CGC ATT TCC CTC ACC CAG 1080 
Met Val Tyr Gin Gly Val Asn Glu Gly Arg He Ser Leu Thr Gin 

350 355 360 
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TTC 

Phe 

TTC 
Phe 

GTC 
Val 

ATG 
Met 

AAG 
Lys 

GAC 
Asp 

AAG 
Lys 



GTA 
Val 

CCG 
Pro 

CTT 
Leu 

CAT 
His 

GGC 
Gly 

GAG 
Glu 

CGC 
Arg 



GAA 
Glu 

GAA 
Glu 

TGG 
Trp 

AAC 
Asn 

GTG 
Val 

GGA 
Gly 

CGC 
Arg 



CTG GTC 
Leu Val 

365 
AAA GGA 
Lys Gly 

380 
GAT CCC 
Asp Pro 

395 
GCC ATG 
Ala Met 

410 
CCG AAG 
Pro Lys 

425 
ACC TAC 
Thr Tyr 

440 
AAA TAC 
Lys Tyr 

455 



GCC ACG 
Ala Thr 

ACG GTC 
Thr Val 

GAG GCT 
Glu Ala 

GAT TAC 
Asp Tyr 

ACA GTG 
Thr Val 

GTG GGG 
Val Gly 

AAG CAA 
Lys Gin 



CGC CCG 
Arg Pro 

GCG GTC 
Ala Val 

GAA ATG 
Glu Met 

TCC TCC 
Ser Ser 

CTG CTG 
Leu Leu 

GCG CCG 
Ala Pro 

TAA 



GCC 
Ala 
370 
GGT 
Gly 
385 
GTG 
Val 
400 
TAC 
Tyr 
415 
CGT 
Arg 
430 
ACG 
Thr 
445 



AAG 
Lys 

TCG 
Ser 

ATC 
He 

GAG 
Glu 

GGC 
Gly 

GAT 
Asp 



GTC 
Val 

GAT 
Asp 

GAA 
Glu 

GGA 
Gly 

AAG 

Lys 

GGC 
Gly 



TTC 
Phe 

GCC 
Ala 

CAA 
Gin 

CAC 
His 

GTG 
Val 

CAG 
Gin 



GGC 
Gly 

GAC 
Asp 

AGC 
Ser 

AAG 
Lys 

ATC 

He 

TTC 
Phe 



ATG 1125 

Met 

375 

ATC 1170 

He 

390 

GCC 1215 

Ala 

405 

ATC 12 60 

He 

420 

GTC 1305 

Val 

435 

CGG 1350 

Arg 

450 

1374 



Claims 

35 

1. A process for the production of D-a-amino acids by the stereospecific conversion of racemic mixtures 
of 5-substituted hydantoins characterized in that, the conversion reaction is carried out in the presence 
of a microorganism transformed with the plasmid pSM651 CBS 203.94 capable of expressing at high 
levels and without inducers an enzymatic system capable of converting said hydantoins into the 

40 corresponding D-a-amino acids. 

2. The process according to claim 1, characterized in that, the conversion reaction is carried out in the 
presence of the enzymatic system isolated from a microorganism transformed with the plasmid 
pSM651 CBS 203.94. 

45 

3. The process according to claim 2 t characterized in that, said enzymatic system is immobilized on an 
insoluble support. 

4. The process according to claim 1, characterized in that, the microorganisms are selected from the 
so group of Bacillus subtilis and Escherichia coli . 

5. The process according to claim 1 , characterized in that, the 5-substituted hydantoin is selected from 
D.L-5-phenyIhydantoin, D,L-5-para-hydroxyphenylhydantoin, D.L-5-methylhydantoin, D,L-5-isopropyl- 
hydantoin, D,L-5-thienylhydantoin, D,L-5-para-methoxyphenylhydantoin, D,L-5-para-chloro phenylhydan- 

55 toin, D,L-5-benzylhydantoin. 

6. The process according to claim 5, characterized in that, the hydantoin is D.L-5-para-hydroxyphenyl- 
hydantoin. 
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7. The process according to claim 5, characterized in that, the hydantoin is D,L-5-phenylhydantoin. 

8. The process according to claim 1, characterized in that, the conversion reaction is carried out at a 
temperature of between 20 • C and 60 • C. 

9. The process according to claim 8, characterized in that, the temperature is between 30 * and 45 ■ C. 

10. The process according to claim 1, characterized in that, the conversion reaction is carried out at a pH 
of between 6.0 and 10. 

11. The process according to claim 10, characterized in that, the pH is between 7.0 and 8.5. 

12. The process according to claim 1, characterized in that, the conversion reaction is carried out using a 
weight ratio biomass/hydantoins of between 1/1 and 1/50. 

13. Plasmid pSM651 deposited at the Bureau Voor Schimmelcultures, SK Baam (Holland) where it has 
received the deposit number CBS 203.94. 

14. A microorganism selected from Bacillus subtilis and Escherichia coli transformed with the plasmid 
20 % pSM651. 

15. The microorganism according to claim 10, which is Escherichia coli SMC305 CBS 203.94. 
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FIG. 4A 

ATC ACA CGT CAG ATG ATA CTT GCT CTC CGA CAG CAA GGC CCC ATC 45 
Met Thr Arg Gin Met lie Leu Ala Val Gly Gin Gin Cly Pro lie 

5 io is 

GCG CGA GCG GAG ACA CGC GAA CAG CTG CTT GGC CGC CTC CTC GAC 90 ! 

Ala Arg Ala Glu Thr Arg Glu Gin Val Val Gly Arg Leu Leu Asp ^t'l-^j; 

20 25 30 | ^ 

ATG TTG ACG AAC GCA GCC AGC CGG GGC GTG AAC TTC ATC GTC TTT 135 j# ; 

Met Leu Thr Aon Ala Ala Ser Arg Gly Val Asn Phe He Val Phe 

35 40 45 

CCC GAG CTT GCG CTC ACG ACC TTC TTC CCG CGC TGG CAT TTC ACC 180 
Pro Glu Leu Ala Leu Thr Thr Phe Phe Pro Arg Trp His Phe Thr 

50 55 60 J_";'V 

GAC GAG CCC GAC CTC GAT AGC TTC TAT GAC ACC CAA ATG CCC GGC 225 X 
Asp Glu Ala Glu Leu Asp Ser Phe Tyr Glu Thr Glu Met Pro Gly 

65 70 75 

CCG GTG GTC CGT CCA CTC TTT GAG ACG GCC CCC GAA CTC GGG ATC 270 
Pro Val Val Arg Pro Leu Phe Glu Thr Ala Ala Glu Leu Gly lie 

80 85 90 

GGC TTC AAT CTG GGC TAC GCC GAA CTC GTC GTC GAA GGC GGC GTC 315 & 
Gly Phe Asn Leu Cly Tyr Ala Glu Leu Val Val Glu Gly Gly Val 

95 100 105 

AAG CGT CCC TTC AAC ACG TCC ATT CTG GTG GAT AAG TCA GGC AAG 360 ff;^ 
Lye Arg Arg Phe Asn Thr Ser He Leu Val Asp Lya Ser Gly Lys "-/^ ' ' 

110 115 120 ^SfW( 

ATC GTC GGC AAG TAT CGT AAG ATC CAT TTG CCG GGT CAC AAG GAC 415 \ 
lie Val Gly Lya Tyr Arg Lya He His Leu Pro Gly Bye Lye Glu 

125 130 135 

TAC GAG GCC TAC CGG CCG TTC CAG CAT CTT GAA AAG CGT TAT TTC 450 
Tyr Glu Ala Tyr Arg Pro Phe Gin His Leu Glu Lys Arg Tyr Phe 

140 1'5 150 



...... . . 
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FIG. 4B 

CAC CCG GGC GAT CTC GGC TTC CCG GTC TAT GAC GTC GAC GCC GCG 495 
Glu Pro Gly Asp Leu Gly Phe Pro Val Tyr Asp Val Asp Ala Ala 

155 160 165 

AAA ATG GGG ATG TTC ATC TGC AAC GAT CGC CGC TGG CCT GAA ACG 540 
Lys Met Gly Met Phe He Cys Asn Asp Arg Arg Trp Pro Glu Thr 

170 175 180 

TGG CGG GTC ATC GCA CTT AAG GGC GCC GAC ATC ATC TGC GGC GGC 585 
Trp Arg Val Met Gly Leu Lys Gly Ala Glu He He Cys Gly Gly 

185 190 195 

TAC AAC ACG CCC ACC CAC AAT CCC CCC CTT CCC CAG CAC GAC CAT 630 
Tyr Asn Thr Pro Thr His Asn Pro Pro Val Pro Gin His Asp His 

200 205 210 

CTG ACG TCC TTC CAC CAC CTT CTG TCG ATG CAG GCC GGG TCG TAC 675 
Leu Thr Ser Phe His His Leu Leu Ser Met Gin Ala Gly Ser Tyr 

215 220 225 

CAA AAC GGC GCC TGG TCC GCG GCG GCC GGC AAG GTC GGC ATG GAG 720 
Gin Asn Gly Ala Trp Ser Ala Ala Ala Gly Lys Val Gly Met Glu 

230 235 240 

GAG GGG TGC ATG CTG CTC GGC CAT TCG TGC ATC CTG GCC CCG ACC 765 
Glu Gly Cys Met Leu Leu Gly His Ser Cys lie Val Ala Pro Thr 

245 250 255 

GGC GAA ATC GTT GCC CTG ACC ACG ACG TTC GAA GAC GAG GTG ATC 810 
Gly Glu He Val Ala Leu Thr Thr Thr Leu Glu Asp Glu Val He 

260 265 270 

ACC GCC GCC GTC GAT CTC GAC CGC TGC CGG GAA CTG CGC GAA CAC 855 
Thr Ala Ala Val Asp Leu Asp Arg Cys Arg Glu Leu Arg Glu His 

275 280 285 

ATC TTC AAT TTC AAA GCC CAT CCT CAG CCA CAG CAC TAC CCT CTC 900 
He Phe Asn Phe Lys Ala His Arg Gin Pro Gin His Tyr Gly Leu 

290 295 300 

ATC GCC GAA TTT TCA 915 
He Ala Glu Phe STOP 
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FIG. 5A 

ATG GAT ATC ATC ATC AAG AAC GGA ACC ATC GTA ACC GCG GAG GGG 45 
Met Asp lie lie He Lys Asn Gly Thr He Val Thr Ala Asp Gly 

5 10 15 ■ 

ATT TCT CCC GCC GAT CTC GGA ATC AAG GAT GGC AAG ATC GCC CAG 90 
lie Ser Pro Ala Asp Leu Gly He Lys Asp Gly Lys lie Ala Gin 

20 25 30 

ATC GGC GGA ACG TTC GGC CCG GCC GGC CGG ACA ATC GAC GCC TCC 135 ' ' 

He Gly Gly Thr Phe Gly Pro Ala Gly Arg Thr lie Asp Ala Ser 

35 40 45 

GGC CGC TAC GTT TTT CCG GGC GGC ATC GAC GTT CAT ACG CAT GTC 180 
Gly Arg Tyr Val Phe Pro Gly Gly He Asp Val His Thr His Val 

50 55 60 

GAG ACG GTC AGC TTC AAC ACG CAG TCG GCC GAC ACA TTC GCA ACC 225 ~ ^ ; 

Glu Thr Val Ser Phe Asn Thr Gin Ser Ala Asp Thr Phe Ala Thr 

65 70 75 

GCG ACG CTC GCG GCC GCC TCT GGC GGC ACG ACG ACC ATC GTC GAT 270 
Ala Thr Val Ala Ala Ala Cys Gly Gly Thr Thr Thr lie Val Asp 

80 85 90 

TTC TGC CAG CAG GAC CGC GGC CAT AGC CTG AGG GAG GCG CTC GCC 315 
Phe Cys Gin Gin Asp Arg Gly Hys Ser Leu Arg Glu Ala Val Ala 

95 100 105 &£3M 

AAA TGG GAC GGC ATG GCC GGC GGC AAG TCG GCG ATC GAC TAC GGC 360 
Lye Trp Asp Gly Met Ala Gly Gly Lys Ser Ala He Asp Tyr Gly 

no 115 120 -yWMh 

TAC CAT ATC ATC GTG CTC GAT CCG ACT GAT AGC CTG ATC GAG GAG 405 
Tyr His He He Val Leu Asp Pro Thr Asp Ser Val He Glu Glu 

125 130 135 - 

CTA GAG GTA CTG CCA GAT CTC GGC ATC ACC TCC TTC AAG CTC TTC 450 I > - 

Leu Glu Val Leu Pro Asp Leu Gly He Thr Ser Phe Lys Val Phe 

«:->: : 
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FIG. 5B 

140 145 150 

ATC GCT TAT CGC GGC ATC AAC ATG ATC GAC GAC GTG ACA CTG CTC 495 
Met Ala Tyr Arg Gly Met Asn Met lie Asp Asp Val Thr Leu Leu 

155 160 165 |iSv: V,;i ^ 

AGG ACG CTC GAC AAG GCC GCC AAG ACT GGG TCA CTC CTC ATC GTG 540 
Arg Thr Leu Asp Lys Ala Ala Lys Thr Gly Ser Leu Val Met Val 

170 175 180 

CAC GCG GAG AAC GGC GAC GCC GCC GAC TAT CTT CGC GAC AAG TTC 585 
His Ala Glu Asn Gly Asp Ala Ala Asp Tyr Leu Arg Asp Lys Phe 

185 190 195 

GTC GCC GAT GGC AAA ACG GCG CCG ATC TAC CAC GCG CTC AGC CGT 630 
Val Ala Asp Gly Lys Thr Ala Pro lie Tyr His Ala Leu Ser Arg 

200 205 210 

CCG CCC CGG GTC GAA GCC GAG GCG ACC GCG AGG GCC CTC GCC CTG 675 

Pro Pro Arg Val Glu Ala Glu Ala Thr Ala Arg Ala Leu Ala Leu S^^l^: 

215 220 225 

GCG GAA ATC GTC AAC GCC CCG ATC TAC ATC GTG CAT CTG ACC TGC 720 
Ala Glu He Val Asn Ala Pro He Tyr He Val His Leu Thr Cys 

230 235 240 

GAA GAA TCC TTC GAC GAC TTC ATG CCG CCA AAG GCT CGG GCT CTC 765 
Glu Glu Ser Phe Asp Glu Leu Met Arg Ala Lys Ala Arg Gly Val 

245 250 255 

CAC GCC CTG GCC GAA ACC TGC ACA CAA TAC CTC TAC CTC ACC AAG 810 

His Ala Leu Ala Glu Thr Cys Thr Gin Tyr Leu Tyr Leu Thr Lys ^>£.yi 

260 265 270 

GAC GAC CTC GAG CGG CCG GAT TTC GAG CGC GCG AAG TAT GTT TTC 855 
Asp Asp Leu Glu Arg Pro Asp Phe Glu Gly Ala Lys Tyr Val Phe 

275 280 285 

ACC CCG CCT CCG CGC ACG AAG AAG GAC CAG GAA ATC CTC TGG AAC 900 
Thr Pro Pro Pro Arg Thr Lys Lys Asp Gin Glu He Leu Trp Asn. 

290 295 300 

GCA CTC CGG AAC GGG CTC CTC GAA ACC GTC TCC TCC CAC CAT TCT 945 
Ala Leu Arg Asn Gly Val Leu Glu Thr Val Ser Ser Asp His Cys 
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FIG. SC 

305 310 315 

TCC TGG CTC TTC GAG GGG CAC AAG GAT CGG GGC AGG AAC GAC TTC 990 " > 

r v v V?. 

Ser Trp Leu Phe Glu Gly Hi a Lys Asp Arg Gly Arg Asn Asp Phe 

320 325 330 t " ^ 

CGC GCC ATC CCG AAC GGA GCG CCG GGC CTC GAG GAG CGG CTG ATG 1035 

Arg Ala He Pro Asn Gly Ala Pro Gly Val Glu Glu Arg Leu Met 

335 340 345 

ATG GTC TAT CAG GGC GTC AAC GAA GGC CGC ATT TCC CTC ACC CAG 1080 

Met Val Tyr Gin Gly Val Asn Glu Gly Arg He Ser Leu Thr Gin 

350 355 360 

TTC GTA GAA CTG GTC GCC ACG CGC CCG GCC AAG GTC TTC GGC ATG 1125 

Phe Val Glu Leu Val Ala Thr Arg Pro Ala Lys Val Phe Gly Met 

365 370 375 

TTC CCG GAA AAA GGA ACG GTC GCG GTC GGT TCG GAT GCC GAC ATC 1170 

Phe Pro Glu Lys Gly Thr Val Ala Val Gly Ser Asp Ala Asp He 

380 365 390 

GTC CTT TGG GAT CCC GAG GCT GAA ATG GTG ATC GAA CAA AGC GCC 1215 

Val Leu Trp Asp Pro Glu Ala Glu Met Val lie Glu Gin Ser Ala 

395 400 405 

ATG CAT AAC GCC ATG GAT TAC TCC TCC TAG CAG GGA CAC AAG ATC 1260 

Met His Asn Ala Met Asp Tyr Ser Ser Tyr Glu Gly His Lys He 

410 415 420 j;;*; 

AAG GGC CTG CCG AAG ACA GTG CTG CTG CGT GGC AAG GTG ATC GTC 1305 

Lys Gly Val Pro Lys Thr Val Leu Leu Arg Gly Lys Val He Val 

425 430 435 mk' 

GAC CAG CCA ACC TAC GTC CGG GCC CCG ACG GAT GGC CAC TTC CCG 1350 
Asp Glu Gly Thr Tyr Val Gly Ala Pro Thr Asp Gly Gin Phe Arg 

440 445 450 

AAG CGC CGC AAA TAC AAG CAA TAA 1373 ' " r 

Lys Arg Arg Lys Tyr Lys Gin STOP 

455 
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FIG. 4A 

ATC ACA CCT CAC ATC ATA CTT CCT CTC CCA CAG CAA CGC CCC ATC 45 
W«t Thr Arg Oln Mat tie Leu Ala V*l Cly Gin Clu Cly Pro I la 

5 10 15 

CCG CGA GCO OAO ACA OOC CAA CAC CTC CTT CCC CCC CTC CTC CAC 90 
Ala Arg A La Clu Thr Arg Clu Oln Val Val Cly Arg Leu Lou Aip 

20 2S 30 

ATC TTO ACC AAC OCA OOC AGO COG CCC CTC AAC TTC ATC CTC TTT 135 
Hat Leu Thr Aan Ala Ala Sar Arg Cly Val Aan Pha ILa Vai Pha 

35 40 45 

CCC CAC CTT CCC CTC ACC ACC TTC TTC CCC CCC TCC CAT TTC ACC 1B0 
Pro Clu Leu Ail Leu Thr Thr Pha Pha Pro Arg Trp Hi a Pha Thr 

50 55 60 

CAC GAG CCC CAC CTC CAT ACC TTC TAT CAC ACC OAA ATC CCC CCC 225 
Aap clu Ala Clu Leu Asp 3er Pha Tyr Clu Thr Olu Mat Pro Cly 

65 70 75 

CCC CTC CTC CCT CCA CTC TTT GAG ACC CCC CCC CAA CTC OCO ATC 270 
Pro Val Val Arg Pro Lau Pha Clu Thr Ala Ala Clu Lau Cly I la 

80 85 90 

CCC TTC AAT CTC CCC TAG GOC CAA CTC CTC OTC CAA CCC CGC CTC 515 
Oly Ph« Aan Leu Gly Tyr Ala Clu Lau Val Val Olu Cly GLy Val 

95 100 105 

AAG CCT CGC TTC AAC ACG TCC ATT CTC CTC OAT AAC TCA CGC AAO 360 
Lya Arg Arg Pha Aan Thr Sar 11a Lau Val Aap lya Ser GLy Lya 

110 115 120 

ATC CTC CGC AAC TAT CGT AAC ATC CAT TTC CCC OCT CAC AAC CAC 415 
lie Val Cly Lya Tyr Arg Lya 11a Hia Leu Pro Oly Byo Lya Glu 

125 130 135 

TAC CAC CCC TAC CCC CCC TTC CAC CAT CTT GAA AAG CCT TAT TTC 450 
tyr Clu Ala Tyr Arg Fro Phe Oln Rio Lau Glu Lya Arg Tyr Phe 

140 145 ISO 
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FIG. 4B 

CAC CCC GCC CAT CTC GCC TTC CCC CTC TAT CAC GTC GAC GCC CCG 495 
Glu Pro Gly Asp Lou Cly Pha Pro Val Tyr Asp VaL Asp Ala Ala 

1S5 160 165 

AAA ATC GGG ATC TTC ATC TCC AAC GAT CGC CGC TGC OCT CAA ACC 540 
Lya Met Gly Wet Pha I la Cya Aan Aap Arg Arg Ttp Pro Glu Thr 

170 175 180 

TGC CGC CTG ATG CCA CTT AAC QGC GCC GAC ATC ATC TGC CGC GCC 565 
Trp Arg Val feat Gly Lou Lya Cly Ala Glu tie lie Cya Cly Gly 

165 190 195 

TAG AAC ACC CCG ACC CAC AAT CCC CCC CTT CCC CAG CAC GAC CAT 630 
Tyr Km Thr Pro Thr HI* Aan Pro Pro Val Pro Cln Bis Aap Hla 

200 205 210 

CTG ACC TCC TTC CAC CAC CTT CTG TOO ATC CAC GCC CGC TCC TAG 675 
Lau Thr Sar Pha Hi« 01s Lou Lou Sar Hat Glo Ala. Cly Sar Tyr 

215 220 225 

CAA AAC GCC GCC TOG TCC CCG CCG CCC CGC AAC GTC GCC ATG GAG 720 
Cln Aan Cly Ala Tip Sar Ala Ala Ala Cly Lys Val Gly Met Clu 

230 235 240 

CAC CCC TGC AT 3 CTG CTC CGC CAT TCC TGC ATC GTC CCC CCC ACC 765 
Clu Gly Cya Mot Lau Lau Cly Bia Sar Cya Ila Val Ala Pro Thr 

245 250 255 

CCC GAA ATC CTT GCC CTG ACC ACC ACC TTC CAA GAC GAC CTC ATC 610 
Cly Clu lie Val Ala Lau Thr Thr Thr Lau Glu Aap Clu Val IU 

260 265 270 

ACC GCC CCC CTC CAT CTC GAC CGC TGC CCC GAA CTC CCC GAA CAC 655 
Thr Ala Ala Val Aup Leu Aap Arg Cya Arg Glu Leu Arg Clu Hi a 

275 260 265 

ATC TTC AAT TTC AAA GCC CAT CCT CAC CCA CAG CAG TAC OCT CTC 900 
lift She Ann Pho Lya Ala His Arg Gin Pro Gin HI a Tyr Cly Lau 

290 295 300 

ATC GCG CAA. TTT TGA 9 IS 

U« Al Glu Pho STOP tf- <.::,-,;■,,. 
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F1Q, !>A 



ATG CAT 
Mat Aap 

ATT TCT 
Ha Sar 

ATC CCC 
U« oiy 

OOC CCC 

Oly Arg 

DAG AGO 
Clu Thr 

GCG ACG 
AU Thr 

TtC TGC 
Pha Cy* 

AAA TOG 
Ly* Trp 

TAG CAT 
Tyr BiB 

CTA GAG 
Leu Glu 



ATC ATC ATC 
lie He He 
S 

CCC CCC GAT 
Pro Mi Asp 

20 

GGA ACG TTC 
Gly Thr Pha 

35 

TAC CTT TTT 
Tyr Val Pha 
SO 

CTC ACC TTC 
Val Sat Pha 
C5 

CTC GOG CCC 
Val Ala Ala 
80 

CAC CAG GAC 
Gin Gin Asp 
95 

GAC GGC ATG 
Asp Gly Hat 
110 

ATC ATC CTC 
I la I la VtL 

12S 

GTA CTC CCA 
Val Lau Pro 



AAO AAC GGA ACC 
Lya A«n Oly Thr 

CTC oak ATC AAG 
Lau Gly He bye 

CCC ceo OOC OOC 
Gly Pro Ala Gly 

CCC CCC OOC ATC 
Pro Gly Gly Ho 

AAC ACG CAG TCG 
Aan Thr Gin Sar 

CCC TOT OOC GGC 
Ala Cya GLy Gly 

CCC OOC CAT AGC 

Arg Gly ays sar 

OOC GGC GGC AAG 
Ala GLy Oly Lya 

CTC OAT COG ACT 

Lau Aap Pm Thr 

CAT CTC GGC ATC 
Aap Leu Gly Ha 



ATC GTA ACC GOG GAC GGG 45 

I la Val Thr Ala Asp Gly 

10 IS 

GAT GGC AAC ATC CCC CAG 90 

AGP Gly tya Ila Ala Gin 

25 30 

COG ACA ATC GAC CCC TCC 13$ 

Arg/ Thr lie Aap Ala s«r 

40 45 

GAC OTT CAT ACC CAT CTC ISO 

Aap val Hie Thr fits Val 

55 60 

CCC GAC ACA TTC OCA ACC 225 

Ala Aap Thr Ph« Ala Thr 

70 75 

ACG ACG ACC ATC CTC GAT 270 

Thr Thr Thr Ha Val Aap 

S5 90 

CTO ACG GAG GCG GTC CCC 31S 

Lau Arg; Glu ALa Val Ala 

100 105 

TCC GCG ATC GAC TAG GGC 350 

Sar Ala He Asp Tyr Gly 

115 320 

GAT ACC CTC ATC CAG GAG 405 

Aep Sar Val tla Glu Olu . 

130 135 

ACC TCC TTC AAG GTC TTC 450 

Thr sar Pha Lya Val Pha 
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FIG 4 

140 145 150 

ATC OCT TW CCC GCC ATG AAC ATC ATC CAC CAC CTG ACA CTG CTC 495 
Kat AU Tyr Arg Gly Wet Abu Hot He Aap Aap Val Thr Lau Lau 

155 ISO 1*5 

AGO ACC CTC CAC AAC OCC OCC AAO ACT OGG TCA CTC GTC ATC CTC 540 
Artf Thr L*u Aap Lya Aim All Lya Thr Cly Ser Leu Val H«t Val 

170 175 1W fW v*:; 

CAC ODO O AO AAC CGC CAC GCC GCC CAC TAT CTT COC GAG AAO TTC SAS 
His Ala Clu Aan Cly Aap Ala AU Asp Tyr Lau Arg A«p Ly» Ph* 

185 190 195 

CTC OCC CAT COC AAA AGO OCO COG ATC TAC CAC GCC CIC ACC COT 630 
Val AU Aap Oly Lya thr Ala Pr* IU Tyr HU AU Lou Bar Am 

200 205 210 

OCO OCC COG CTC GAA OOC CAC OCT ACC GOG ACC OCC CTC GCC CTO e7S 
Pro Pro Arg Val Olu Ala Glu AU The Ala Aug AU L«u Ala L« 

215 MO MS 

OCO QAA ATC CTC AAC COC CCC ATC TAC ATC OTO CAT CTG ACC TGC 720 
him Glu IU Val Aan AU Pro Hi Tyr I la Val HI a Lau Thr Cya 

230 235 240 

OAA CAA TCC TTC CAC GAG TIG ATC COG OCA AAO OCT COG GOT CTC 765 

Glu Glu Ser Pha Aap Clu Lau Hat Arg Ala Lya Ala Arg Gly Val t:->r*V.-::,v : 

245 250 255 

CAC GCC CTG OCC GAA ACC TGC ACA CAA TAC CTC TAC CTC ACC AAG ©10 
Ml* Ala Lau Ala Glu Thr Cya Thr Gin Tyr lau Tyr Lou Thr Lya 

260 265 270 

CAC CAC CTG GAG CGC CCC GAT TTC OAO OOC OCO AAG TAT OTT tTC 65S 
Aap Aap Leu Glu Arg Pro Aap Fh* Glu Oly Ala Lya Tyr Val Pha 

275 2&0 285 

ACC CCG CCT COO CGC ACG AAC AAC GAC CAG CAA ATC CTC TGG AAC 90Q 
Thr Fro Pro Pro Arg Thr Lya Lya Aap Gin Clu lie Leu Trp Asa. 

290 295 300 

CCA CTC CGC AAC OGC OTC CTC CAA ACG GTC TOC TCC GAC CAT TGT 945 
Ala Lau Ar? Aan Gly Val Leu GU Thr Val Ger Ser Aap fila Cya 
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FIG. K 

305 310 31S 

TCC TGG CTC TTC GAG GOG CAC AAG GAT COG GGC AGO AAC GAC TTC 990 
Ser Trp L«u Phe Glu Cly Hla Lya Asp Arg Cly Arg Aan Aap Phe 

320 325 330 

COC OCC ATC OCO AAC GGA CCC CCC CCC CTC GAG GAC CGG CTG ATC 1035 
Arg Ala lie Pro Aen Cly Ala Pro Oly Val Glu Glu Arg Lau Hot 

335 340 345 

ATO CTC TAT CAG GGC CTC AAC GAA CCC CGC ATT TCC CTC ACC CAC 1080 
Mat Val Tyr Gin Gly Val Awn Glu Cly Arg I la Ser L*u Thr Gin 

350 355 360 

TTC OTA CAA CTC CTC CCC ACC CGC CCC CCC AAG CTC TTC CGC ATG 112S 
Phe Val Glu Lau Val Ala Thr Arg Pro Ala Lya Val Pha Gly Hat 

365 370 375 

TTC CCC CM AAA OQA AOG OTC GCO CTC OOT TCC GAT GCC GAC ATC 1170 
Pha Pro Glu W* Gly Thr val Ala Val oly Ser Aap Ala Asp I la 

330 385 390 

CTC CTT TCC GAT CCC CAG OCT GAA ATC GTG ATC GAA CAA AGC CCC 1215 
Val Leu Trp Aap Pro Glu Ala Glu Mat Val I la Glu Glo Sar Ala 

39S 400 405 

ATG CAT AAC GCC ATO OAT TAG TCC TCC TAC GAG OGA CAC AAG ATC 1260 
Net His Aan Ala Met Aap Tyr Ser Ser Tyr Glu Gly Hla Lya lie 

410 415 420 

AAG GGC GTG COO AM ACA GTG CTC CTC COT GGC AAG GTG ATC OTC 1305 | ' > ' T 

Lye Oly Val pre Lya Thr Val Lau Lau Arg oly Lya val He Val 

425 430 435 

GAC GAG GGA ACC TAC GTG CGG GCC CCC ACC GAT GGC CAG TTC CGG 1350 
Aap Clu Gly Thr Tyr Val Cly Ala Pro Thr Aap Gly Gin Pha Arg 

440 445 450 

AAG CGC CGC AAA TAC AAG CAA TAA 1373 
Lya Arg Arg Lya Tyr Lya Gin STOP 

455 
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